# ONNX Optimization

Parakeet Tdt 0.6b V2 Onnx
NVIDIA Parakeet TDT 0.6B V2 is a model based on automatic speech recognition (ASR) tasks, suitable for English speech-to-text tasks.
Speech Recognition English
P
istupakov
129
3
Dinov2 Base ONNX
This is the ONNX format version of the facebook/dinov2-base model, suitable for computer vision tasks.
Transformers
D
onnx-community
19
0
Xlm Roberta Base Language Detection Tfjs
MIT
This is a multilingual detection model based on XLM-RoBERTa, supporting recognition of 20 languages.
Text Classification Supports Multiple Languages
X
dnouv
14
1
Moonshine Tiny ONNX
MIT
Moonshine Tiny is a lightweight automatic speech recognition (ASR) model suitable for embedded devices and edge computing scenarios.
Speech Recognition Transformers
M
onnx-community
60
6
Prompt Injection Defender Large V0 Onnx
TestSavantAI models are a set of fine-tuned classifiers specifically designed to defend against prompt injection and jailbreak attacks targeting large language models (LLMs).
Text Classification Transformers English
P
testsavantai
3,225
1
Prompt Injection Defender Large V0
The TestSavantAI model is a set of classifiers specifically designed to defend against prompt injection and jailbreak attacks in large language models (LLMs). The tiny version is based on the BERT-tiny architecture, balancing security and computational efficiency.
Text Classification Transformers English
P
testsavantai
23
2
Granite Timeseries Patchtsmixer
A time series forecasting model based on the PatchTSMixer architecture, developed by IBM, suitable for multivariate time series forecasting tasks.
Climate Model Transformers
G
onnx-community
181
0
Microsoft Speecht5 Tts ONNX
This is the ONNX format conversion of Microsoft's SpeechT5 text-to-speech (TTS) model, optimized for Transformers.js
Speech Synthesis Transformers English
M
eligapris
46
1
Whisper Large V3 Turbo
An ONNX-optimized Whisper large speech recognition model designed for web deployment
Speech Recognition Transformers
W
onnx-community
2,988
54
Bart Large Cnn
MIT
This is an ONNX-optimized version of the facebook/bart-large-cnn model, primarily used for text summarization tasks.
Text Generation Transformers
B
philipp-zettl
15
0
Timesformer Hr Finetuned K400
TimeSformer-HR is a high-resolution spatiotemporal Transformer model for video, fine-tuned on the Kinetics-400 dataset, suitable for video action recognition tasks.
Video Processing Transformers
T
onnx-community
17
0
Bge Reranker V2 M3 Onnx O4
Apache-2.0
The ONNX O4 version of BGE-RERANKER-V2 is an optimized text reordering model that supports relevance scoring for multilingual text pairs.
Text Classification Transformers
B
hooman650
39
5
Depth Anything V2 Base
Depth-Anything-V2-Base is an ONNX-format depth estimation model adapted for Transformers.js, designed for image depth estimation on the web.
3D Vision Transformers
D
onnx-community
56
0
Lakshyakh93 Deberta Finetuned Pii Onnx
Apache-2.0
This is the ONNX-converted version of the lakshyakh93/deberta_finetuned_pii model, designed to identify Personally Identifiable Information (PII) in text.
Sequence Labeling Transformers English
L
protectai
1,817
1
Musicgen Small
MusicGen Small is a Transformer-based music generation model capable of producing high-quality music clips from text descriptions.
Audio Generation Transformers
M
Xenova
5,434
24
Yolov9 C All
Gpl-3.0
Object detection model based on YOLOv9, adapted for Transformers.js, capable of running in a browser
Object Detection Transformers
Y
Xenova
176
2
Bge M3 Onnx
MIT
BGE-M3 is an embedding model that supports dense retrieval, lexical matching, and multi-vector interaction, converted to ONNX format for compatibility with frameworks like ONNX Runtime.
Text Embedding Transformers
B
aapot
292
29
Bge M3 Onnx O4
MIT
This is the ONNX quantized version of the BAAI/bge-m3 model, supporting three functionalities: dense retrieval, multi-vector retrieval, and sparse retrieval, covering over 100 languages.
Text Embedding Transformers
B
hooman650
285.96k
10
Gte Base Onnx
Apache-2.0
GTE-Base is a general-purpose text embedding model capable of converting text into high-dimensional vector representations, suitable for text classification and similarity search tasks.
Text Embedding Transformers
G
Qdrant
31
3
Xlm Roberta Base Language Detection ONNX
A multilingual detection model based on XLM-RoBERTa, capable of identifying the language category of text.
Text Classification Transformers
X
Oblix
16
1
Chinese Clip Vit Base Patch16
Chinese CLIP model based on ViT architecture, supporting multimodal understanding of images and text
Text-to-Image Transformers
C
Xenova
264
1
Hubert Base Superb Ks
A voice command recognition model based on the HuBERT architecture, optimized for keyword spotting tasks
Audio Classification Transformers
H
Xenova
17
1
Multilingual E5 Small Onnx
Apache-2.0
This is a multilingual sentence transformer model that maps text to a dense vector space, supporting semantic search and clustering tasks
Text Embedding English
M
nixiesearch
96
1
Xlm Roberta Base Language Detection Onnx
MIT
This is the ONNX format conversion of the papluca/xlm-roberta-base-language-detection model, designed for multilingual text classification tasks, supporting detection in 20 languages.
Text Classification Transformers Supports Multiple Languages
X
protectai
6,535
6
Deberta V3 Base Injection Onnx
MIT
This is the ONNX-converted version of the deepset/deberta-v3-base-injection model for detecting prompt injection attacks.
Text Classification Transformers English
D
protectai
30
2
Nougat Base
Nougat is a vision-based academic document understanding model capable of converting scientific PDF images into Markdown-formatted text.
Image-to-Text Transformers
N
Xenova
24
3
Swin2sr Classical Sr X4 64
A classical image super-resolution model based on Swin2SR architecture, capable of upscaling image resolution by 4 times
Image Enhancement Transformers
S
Xenova
19
0
E5 Large V2 Onnx
Apache-2.0
This is a sentence transformer model that maps sentences and paragraphs into a dense vector space, suitable for tasks such as clustering and semantic search.
Text Embedding English
E
nixiesearch
114
0
Whisper Large V2 Onnx Int4 Inc
Apache-2.0
Whisper is a pre-trained automatic speech recognition (ASR) and speech translation model, trained on 680,000 hours of labeled data, demonstrating strong generalization capabilities. This repository contains the INT4 weight-only quantized version of the Whisper large v2 model in ONNX format.
Speech Recognition Transformers
W
Intel
19
27
Whisper Large Onnx Int4 Inc
Apache-2.0
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. This repository provides the Whisper large model in ONNX format with INT4 weight quantization, powered by Intel® Neural Compressor and Intel® Transformers Extension.
Speech Recognition Transformers
W
Intel
44
8
Bge Large En V1.5 Quant
MIT
Quantized (INT8) ONNX variant of BGE-large-en-v1.5 with inference acceleration via DeepSparse
Text Embedding Transformers English
B
RedHatAI
1,094
22
Clip Vit Large Patch14
OpenAI's open-source CLIP model, based on Vision Transformer (ViT) architecture, supporting joint understanding of images and text.
Text-to-Image Transformers
C
Xenova
17.41k
0
Wav2vec2 Large Xlsr 53 English
Large-scale speech recognition model based on the wav2vec 2.0 architecture, supporting English speech-to-text conversion
Speech Recognition Transformers
W
Xenova
14
2
Clip Vit Base Patch32
CLIP model developed by OpenAI, based on Vision Transformer architecture, supporting joint understanding of images and text
Text-to-Image Transformers
C
Xenova
177.13k
8
Clip Vit Base Patch16
OpenAI's open-source CLIP model, based on Vision Transformer architecture, supporting cross-modal understanding of images and text
Text-to-Image Transformers
C
Xenova
32.99k
9
Sbert All MiniLM L6 With Pooler
Apache-2.0
This is an ONNX-converted model based on sentence-transformers/all-MiniLM-L6-v2, capable of mapping sentences and paragraphs into a 384-dimensional dense vector space, suitable for tasks like clustering or semantic search.
Text Embedding Transformers English
S
vamsibanda
28
0
Vit Base Patch16 224
Apache-2.0
Image classification model based on Transformer architecture, pre-trained and fine-tuned on ImageNet-21k and ImageNet-1k datasets
Image Classification Transformers
V
optimum
40
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase